Method and apparatus for measurements of patterned structures

ABSTRACT

A method for measuring at least one desired parameter of a patterned structure having a plurality of features defined by a certain process of its manufacturing. The structure represents a grid having at least one cycle formed of at least two metal-containing regions spaced by substantially transparent regions with respect to incident light defining a waveguide. An optical model is provided, which is based on at least some of the features of the structure defined by a certain process of its manufacturing, and on the relation between a range of the wavelengths of incident radiation to be used for measurements and a space size between the two metal-containing regions in the grid cycle, and a skin depth of said metal. The model is capable of determining theoretical data representative of photometric intensities of light components of different wavelengths specularly reflected from the structure and of calculating said at least one desired parameter of the structure. A measurement area is located and spectrophotometric measurements are applied to the measurement area, by illuminating it with incident light of a preset substantially wide wavelength range. A light component substantially specularly reflected from the measurement area is detected, and measured data representative of photometric intensities of each wavelength within the wavelength range is obtained. The measured and theoretical data are analyzed and the optical model is optimized until the theoretical data satisfies a predetermined condition. Upon detecting that the predetermined condition is satisfied, said at least one parameter of the structure is calculated.

This is a continuation of application Ser. No. 10/919,823, filed Aug. 17, 2004, which is a continuation of application Ser. No. 10/011,263, filed Nov. 13, 2001 now U.S. Pat. No. 6,836,324, which is a continuation-in-part of application Ser. No. 09/605,664, filed Jun. 26, 2000 now U.S. Pat. No. 6,476,920, which is a continuation-in-part of application Ser. No. 09/267,989, filed Mar. 12, 1999 now U.S. Pat. No. 6,100,985, which is a continuation-in-part of application Ser. No. 09/092,378, filed Jun. 5, 1998 now abandoned.

FIELD OF THE INVENTION

This invention is in the field of measurement techniques and relates to a method and a system for measuring the parameters of patterned structures.

BACKGROUND OF THE INVENTION

Techniques for thickness measurements of patterned structures have been developed. The term “patterned structure” used herein, signifies a structure formed with regions having different optical properties with respect to an incident radiation. More particularly, a patterned structure represents a grid having one or more cycles, each cycle being formed of at least two different locally adjacent stacks. Each stack is comprised of layers having different optical properties.

Production of integrated circuits on semiconductor wafers requires maintaining tight control over the dimensions of small structures. Certain measuring techniques enable the local dimensions of a wafer to be measured with relatively high resolution, but at the expense of discontinued use of the wafer in production. For example, inspection using a scanning electron microscope gives measurements of the parameters of a patterned structure, but at the expense of cleaving it and thus excluding it from continued processing. Mass production of patterned structures such as wafers requires a non-destructive process for controlling thin film parameters in a manner enabling the local measurements to be performed.

One kind of the conventional techniques for measuring thickness of thin films is disclosed in U.S. Pat. No. 4,999,014. The technique is based on the use of small spot size and large numerical aperture for measurements on small areas. Unfortunately, in the case of a very small structure, this approach suffers from a common drawback associated, on the one hand, with the use of a small spot-size and, on the other hand, owing to the large numerical aperture, with the collection of high diffraction orders. The term “small spot-size” signifies the spot diameter similar in size to the line or space width of the measured structure, i.e. a single grid cycle. This leads to various problems, which are difficult to solve. Indeed, not all the stacks' layers are in the focus of an optical system used for collecting reflected light, the optical system being bulky and complicated. Detected signals are sensitive to small details of a grid profile and to small deviations in the spot placement. Diffraction effects, which depend significantly on the grid profile and topography and therefore are difficult to model, have to be included in calculations.

Another example of the conventional techniques of the kind specified is disclosed in U.S. Pat. No. 5,361,137 and relates to a method and an apparatus for measuring the submicron linewidths of a patterned structure. The measurements are performed on a so-called “test pattern” in the form of a diffraction grating, which is placed in a test area of the wafer. Here, as in most conventional systems, a monochromatic incident light is employed and diffraction patterns are produced and analyzed. However, a large number of test areas are used and also information on multiple parameters cannot be obtained.

According to some conventional techniques, for example that disclosed in U.S. Pat. No. 5,087,121, portions with and without trenches are separately illuminated with broadband light, the reflection spectrum is measured and corresponding results are compared to each other with the result being the height or depth of a structure. However, it is often the case that the structure under inspection is such that the different portions cannot be separately imaged. This is owing to an unavoidable limitation associated with the diameter of a beam of incident radiation striking the structure.

The above approach utilizes frequency filtering to enable separation of interference signals from different layers. This is not feasible for layers of small thickness and small thickness difference because of a limited number of reflection oscillations.

Yet another example of the conventional technique for implementing depth measurements is disclosed in U.S. Pat. No. 5,702,956. The method is based on the use of a test site that represents a patterned structure similar to that of the wafer (circuit site), but taken in an enlarged scale. The test site is in the form of a plurality of test areas each located in the space between two locally adjacent circuit areas. The test areas are designed so as to be large enough to have a trench depth measured by an in-line measuring tool. The measurements are performed by comparing the parameters of different test areas assuming that the process is independent of feature size. For many processes in the field such as etching and photoresist development, this assumption is incorrect and this method is therefor inapplicable.

SUMMARY OF THE INVENTION

It is a major object of the present invention to overcome the above listed and other disadvantages of the conventional techniques and provide a novel method and system for non-destructive, non-contact measurements of the parameters of patterned structures.

It is a further object of the invention to provide such a method and system that enables the relatively small amount of information representative of the structure's conditions to be obtained and successfully processed for carrying out the measurements, even of very complicated structures.

According to one aspect of the present invention, there is provided a method for measuring at least one desired parameter of a patterned structure, which represents a grid having at least one cycle formed of at least two metal-containing regions spaced by substantially transparent regions with respect to incident light thereby defining a waveguide, the structure having a plurality of features defined by a certain process of its manufacturing, the method comprising the steps of:

-   -   (a) providing an optical model, which is based on at least some         of said features of the structure, on relation between a         wavelength range of the incident light to be used for         measurements and a space size between the two metal-containing         regions in the grid cycle, and a skin depth of said metal, and         is capable of determining theoretical data representative of         photometric intensities of light components of different         wavelengths specularly reflected from the structure and of         calculating said at least one desired parameter of the         structure;     -   (b) locating a measurement area for applying thereto         spectrophotometric measurements, wherein said measurement area         is a grid cycles containing area;     -   (c) applying the spectrophotometric measurements to said         measurement area by illuminating it with incident light of a         preset substantially wide wavelength range, detecting light         component substantially specularly reflected from the         measurement area, and obtaining measured data representative of         photometric intensities of each wavelength within said         wavelength range;     -   (d) analyzing the measured data and the theoretical data and         optimizing said optical model until said theoretical data         satisfies a predetermined condition; and     -   (e) upon detecting that the predetermined condition is         satisfied, calculating said at least one parameter of the         structure.

Thus, the present invention utilizes the features of a patterned structure, whose parameters are to be measured, which are defined by manufacturing steps of a certain technological process completed prior to the measurements, and a relation between the wavelength range of incident light used for measurements and a space size between the two metal-containing regions in the grid cycle, and a skin depth of the metal. Actual design-rule features can often be found in the structure in sets (e.g. read lines in memories). The term “design-rule features” signifies a predetermined set of the allowed pattern dimensions used throughout the wafer. Hence, information regarding the desired parameters can be obtained using super-micron tools such as a large spot focused on a set of lines.

The present invention, as distinct from the conventional approach, utilizes a spectrophotometer that receives reflected light substantially from zero-order. The zero-order signal is not sensitive to small details of the grid profile of the structure such as edge rounding or local slopes. This enables the effects associated with diffracted light not to be considered, and thereby the optical model, as well as the optical system, to be simplified.

In the case of wafers, each element in the grid cycle consists of a stack of different layers. The features of such a structure (wafer), which are dictated by the manufacturing process and should be considered by the optical model, may be representative of the following known effects:

-   -   specular reflection from the different stacks within the grid         cycle;     -   interference of reflected light from layers within each stack;     -   dissipation within transparent stacks due to cavity-like         geometry formed in the grid-like structure;     -   specular contributions due to width of stacks relative to the         wavelength;     -   polarization due to the incident beam interaction with a         conductive grid-like structure, if present;     -   effects due to limited coherence of illumination;     -   interference between light beams reflected from each stack         within the grid cycle, taking into account the above effects.

The contribution of each of the above effects into the theoretical data are estimated in accordance with the known physical laws.

The optical model, being based on some of the features, actually requires certain optical model factors to be considered in order to perform precise calculations of the desired parameters. If information of all the features is not available and the model cannot be optimized prior to the measurements, this is done by means of a so-called initial “learning” step. More specifically, there are some optical model factors which, on the one hand, depend variably on all the features and, on the other hand, define the contribution of each of the existing optical effects into the detected signal. The values of these optical model factors are adjusted along with the unknown desired parameters during the learning step so as to satisfy the predetermined condition. The latter is typically in the form of a merit function defining a so-called “goodness of fit” between the measured and theoretical data. The resulting optical model factors can consequently be used in conjunction with known features to enable precise calculations of the desired parameters of the structure.

Preferably, the measurement area is the part of the structure to be measured. Alternatively, the measurement area is located on a test pattern representative of the actual structure to be measured, namely having the same design rules and layer stacks. The need for such a test pattern may be caused by one of the following two reasons:

-   -   1) If the measurement area is not substantially smaller than the         available surface area defined by the actual structure to be         measured, then the test site is implemented so as to include an         extended structure;     -   2) If the structure is very complicated or consists of ambiguous         under-layer structure, then the test site is implemented with         the same geometry as that of the actual structure to be         measured, but with a simplified under-layer design, thus         allowing simplified measurements of the top layers.

According to another aspect of the present invention, there is provided an apparatus for measuring at least one desired parameter of a patterned structure that represents a grid having at least one grid cycle formed of at least two metal-containing regions spaced by substantially transparent regions with respect to incident light defining a waveguide, the structure having a plurality of features defined by a certain process of its manufacturing, the apparatus comprising:

-   -   a spectrophotometer illuminating a measurement area by incident         light of a preset substantially wide wavelength range and         detecting a specular reflection light component of light         reflected from the measurement area for providing measured data         representative of photometric intensities of detected light         within said wavelength range; and     -   a processor unit coupled to the spectrophotometer, the processor         unit comprising a pattern recognition software and a translation         means so as to be responsive to said measured data and locate         measurements, the processor being operable for         -   applying an optical model, based on at least some of said             features of the structure, on relation between wavelength             range of the incident light to be used for measurements and             a space size between the two metal-containing regions in the             grid cycle, and a skin depth of said metal, for providing             theoretical data representative of photometric intensities             of light specularly reflected from the structure within said             wavelength range and calculating said at least one desired             parameter, and         -   comparing said measured and theoretical data and detecting             whether the theoretical data satisfies a predetermined             condition.

Preferably, the spectrophotometer is provided with an aperture stop accommodated in the optical path of the specular reflected light component. The diameter of the aperture stop is set automatically according to the grid cycle of the measured structure.

Preferably, the incident radiation and the reflected light received by the detector are directed along substantially specular reflection axes.

More particularly, the invention is concerned with measuring height/depth and width dimensions on semiconductor wafers and is therefore described below with respect to this application.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to understand the invention and to see how it may be carried out in practice, a preferred embodiment will now be described, by way of non-limiting example only, with reference to the accompanying drawings, in which:

FIGS. 1 a and 1 b are, respectively, schematic cross-sectional and top views of one kind of a patterned structure to be measured;

FIG. 2 schematically illustrates the main components of an apparatus according to the invention for measuring the parameters of a patterned structure;

FIG. 3 is a graphical illustration of the main principles of the present invention, showing the relationship between measured and theoretical data obtained by the apparatus of FIG. 2;

FIG. 4 illustrates yet another example of a patterned structure to be measured with the apparatus of FIG. 3;

FIGS. 5 a and 5 b illustrate a flow diagram of the main steps of a method according to the invention;

FIGS. 6 to 10 are schematic cross-sectional views of five more examples of patterned structures suitable to be inspected by the apparatus of FIG. 2;

FIGS. 11 and 12 illustrate two examples, respectively, of patterned structures to be measured, wherein one-dimensional periodicity in layers is considered;

FIGS. 13A–13C illustrate a two-dimensional structure to be measured, relating to DRAM applications, wherein FIG. 13A is a top view of the structure, and FIGS. 13B and 13C are cross-sectional views taken along lines A—A and B—B, respectively, in FIG. 13A; and

FIG. 14 illustrates a patterned structure for use in yet another example of the present invention

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT

Referring to FIGS. 1 a and 1 b, there are partly illustrated a cross-section and a top view, respectively, of a grid-like wafer structure, generally designated 10, whose parameters are to be measured. The structure is formed of a plurality of cells, generally at 12, each constituting a grid cycle. Only three adjacent cells 12 are demonstrated in the present example with only two stacks (or elements) in each cell in order to simplify the illustration. Thus, the cell 12 comprises two stacks 12 a and 12 b formed of different layers. More specifically, the stack 12 a includes six layers L₁–L₆, wherein the layers L₁ and L₂ and the layer L₆ form two layers L₁ and L_(2,6), respectively, of the stack 12 b. As known in the conventional semiconductor devices, semiconductor structures such as sources, drains and gate electrodes, capacitors, etc. are formed in and on a semiconductor substrate (layer L₁) typically made of silicon material and including metal conductors (e.g. aluminum). The substrate is coated by an insulating silicon oxide compound (layer L₂). The first level metal layer L₄ (and the single level in the present example) is formed, being interposed between top and bottom barrier layer L₃ and L₅ made of titanium nitride (TiN). Deposition coating of an uppermost insulating silicon oxide layer L₆ and subsequent chemical mechanical polishing (CMP), consisting of thinning the uppermost layer L₆, completes the manufacturing. The construction of such a structure and method of its manufacturing are known per se and therefore need not be more specifically described.

According to this specific example, the parameters to be measured are the widths W₁ and W₂ of the stacks 12 a and 12 b and depths d₁ and d₂ of the uppermost silicon oxide layers L₆ and L_(2,6), respectively. It is appreciated that any other parameters of the patterned structure such as, for example, materials and their optical properties, can be measured.

Reference is now made to FIG. 2 illustrating a system, generally designated 14, suitable for carrying out the measurements. The system 14 may represent one of the working stations of a production line (not shown), the wafers 10 progressing between upstream and downstream stations of the production line. The system 14 comprises a support frame 16 for holding the structure 10 within an inspection plane, a spectrophotometer 18 and a processor unit 20 connected thereto. The spectrophotometer 18 typically includes a light source 22 for generating a beam of light 24 of a predetermined wavelength range, light directing optics 26 and a detector unit 28. The light directing optics 26 are typically in the form of a beam deflector comprising an objective lens 30, a beam splitter 32 and a mirror 34. The detector unit 28 typically comprises an imaging lens 36, a variable aperture stop 38 coupled to and operated by its motor 40 and a spectrophotometric detector 42. The construction and operation of the spectrophotometer 18 may be of any known kind, for example, such as disclosed in U.S. Pat. No. 5,517,312 assigned to the assignee of the present application. Therefore, the spectrophotometer 18 need not be more specifically described, except to note the following.

The light beam 24 passes through the light directing optics 26 and impinges onto the structure 10 at a certain location defining a measurement area S₁. Light component 44 specularly reflected from the reflective regions within the area S₁ is directed onto the detector unit 28.

It should be noted that, generally, the illuminated location of the structure may be larger than the measurement area S₁, in which case suitable optics are provided for capturing, in a conventional manner, light reflected solely from the part (area S₁) within the illuminated location. In other words, the measurement area being of interest is included into a spot-size provided by the light beam 24 when impinging onto the structure 10. In order to facilitate understanding, assume that the illuminated area defined by the diameter of the incident beam constitutes the measurement area S₁.

The light directing optics 26 and detector unit 28 are designed such that only a zero-order light component of light reflected from the structure 10 is sensed by the spectrophotometric detector 42. The construction is such that the incident and detected light beams are directed substantially parallel to each other and substantially perpendicular to the surface of the structure 10. The diameter of the aperture stop 38 is variable and is set automatically according to the grid cycle of the measured structure. Generally speaking, the diameter of the aperture stop is optimized to collect the maximum reflected intensity excluding diffraction orders.

Additionally, the diameter of the incident beam 24, defining the measurement area S₁, is substantially larger than the surface area S₀ defined by the cell 12, that is: S₁>S₀ According to this specific example, the patterned structure 10 is a so-called “one-dimensional” structure. As clearly seen in FIG. 1 b, the stacks 12 a and 12 b are aligned along the X-axis, while along the Y-axis the stacks continue to infinity (uniform structure) with respect to the measurement area S₁. In other words, the measurement area S₁ includes a structure that has one or more grid cycles extending along the X-axis and is uniform along the Y-axis.

The whole surface area S of the structure under inspection should be substantially larger than the measurement area S₁ defined by the diameter of the incident beam. S>S₁

The case may be such that the above conditions are not available in the structure 10. For example, the structure may contain a single grid cycle. To this end, the measurement area S₁ consisting of more than one cell 12 should be located on a test-site (not shown).

For example, if the system 14 provides the numerical aperture of 0.2 and spot-diameter (measurement area S₁) about 15μm, the minimum surface area S of a test-site should be 20μm. NovaScan 210 spectrophotometer, commercially available from Nova Measuring Instruments Ltd., Israel, may be used in the system 14.

The spectrophotometer 18 measures the photometric intensities of different wavelengths contained in the detected, zero-order light component of the reflected beam 44. This is graphically illustrated in FIG. 3, being shown as a dashed curve D_(m) constituting the measured data. The processor unit 20 comprises a pattern recognition software and a translation means so as to be responsive to the measured data and locate measurements. It is pre-programmed by a certain optical model based on at least some features of the structure for calculating theoretically the photometric intensities of light of different wavelengths reflected from a patterned structure. This is shown in FIG. 3 as a solid curve D_(t), constituting theoretical data

In order to design the optical model capable of estimating all the possible optical effects, which are dictated by the features of the structure to be measured and affect the resulting data, the following should be considered.

Generally, total specular reflection R from the grid-like structure is formed of a coherent part R_(coh) and an incoherent part R_(incoh). It is known that coherence effects play an essential role in the measurements when a wide bandwidth radiation is used. The coherence length L of light in the optical system is determined by the radiation source and by the optical system (spectrophotometer) itself. Reflection amplitudes from structure's features smaller than the coherence length interact coherently, producing thereby interference effects between light reflected by different stacks of the cell. For larger features, a non-negligible portion of light reflected by different stacks undergoes incoherent interaction without producing interference. The coherence length L defines a mutual coherence ν of light, coming from points separated by half a cycle of the grid structure, and, consequently, defines the degree of coherence γ, that is:

$\begin{matrix} {L = {D \cdot \lambda}} \\ {v = \frac{2 \cdot \pi \cdot \left( {W_{1} + W_{2}} \right)}{2 \cdot L}} \\ {y = \left( \frac{2 \cdot {J_{1}(v)}}{v} \right)^{2}} \end{matrix}$ wherein D is a variable parameter determined experimentally for the actual optical system and stack structure based on the measured reflection spectra (measured data) for grids of varied cycle dimensions; J₁ is a known Bessel function. An approximate initial input for the determination of the parameter D may be given by nominal optical system characteristics. Hence, the total specular reflection R is given: R=γ·R _(coh)+(1−γ)·R _(incoh)

In order to estimate the possible optical effects affecting the above parts of the total reflected signal, the following main factors should be considered, being exemplified with respect to the patterned structure 10 (FIGS. 1 a and 1 b):

1) Filling factors a₁ and a₂:

$\begin{matrix} {\alpha_{1} = \frac{W_{1}}{W_{1} + W_{2}}} \\ {\alpha_{2} = \frac{W_{2}}{W_{1} + W_{2}}} \end{matrix}$ These factors represent the zero-order contribution, which is based only on the ratio of the areas of stacks 12 a and 12 b, respectively, in the reflection calculation. The zero-order signal is not sensitive to small details of the grid profile of the structure 10 such as edge rounding or local slopes. Therefore, the effects associated with diffracted light may not be considered.

2) Size coupling factors c₁ and c₂:

When the width of the stack is close to the wavelength, the filling factors a₁ and a₂ should be corrected for reducing the coupling of the incident radiation to the respective stack. To this end, so-called “coupling factors” c₁ and c₂ should be introduced to the filling factors a₁ and a₂, respectively. The coupling factor gives a negligible effect when the width of the stack is relatively large relative to the wavelength and negates the interaction completely when the stack width is much smaller than the wavelength. Using a heuristic exponential function to give this dependence, the coupling factors are as follows:

$\begin{matrix} {c_{1} = {\exp\left\{ {{{- A} \cdot \exp}\frac{\lambda}{W_{1}}} \right\}}} \\ {c_{2} = {\exp\left\{ {{{- A} \cdot \exp}\frac{\lambda}{W_{2}}} \right\}}} \end{matrix}$ wherein λ is the wavelength of a respective light component; A is a variable factor depending on the dimensions and materials of the structure and is determined experimentally for the actual stack structure, as will be described further below.

3) Dissipation b₂ in cavity-like structures:

It is often the case that one of the stacks is essentially dissipative owing to geometrical effects reducing reflection, which effects typically take place in cavity-like structures. Among these geometrical effects are high aspect-ratio trenches and wave-guiding underneath metal grid-like structures. High aspect-ratio structures are characterized by a dissipative effect that decreases the amount of light reflected back out with phase impact. For example, multiple reflections in deep grooves in metal both reduces the amount of light reflected back out and destroys the phase relation. The above effects are relatively strong for deep geometry and relatively weak for shallow structures (relative to the wavelength). Using a heuristic exponential function to give this dependence, a dissipation factor b₂ is given:

$b_{2} = {\exp\left\{ {{- B} \cdot \frac{d_{2}}{\lambda}} \right\}}$ wherein B is a variable size parameter, which is determined experimentally for the actual stack structure; d₂ is the depth of the cavity-like part of the stack. Here, by way of example only, the stack 12 b is defined as a dissipative one.

In order to model the corrected filling factors, it is assumed that light radiation not reflected from a certain cell's stack from coupling considerations is essentially reflected by other cell's stack(s). The dissipation factor b₂ is taken into account in the reduced effective filling factor of the geometrically dissipative area. Hence, the corrected filling factors are as found: A ₁ =a ₁ ·c ₁ +a ₂·(1−c ₁) A ₂=(a ₂ ·c ₂ +a ₁·(1−c ₁))·b ₂

4) Polarization factors, representing the contribution of polarization effects that may take place in the case of metallic grids:

When the width of a cell's stack is close to the wavelength, a corrective factor should be introduced for reducing the coupling of the incident TE radiation to the respective stack owing to boundary conditions at the edges of metal lines. The polarization factor gives a negligible effect when the width of the stack is large relative to the wavelength and negates the reflection completely when the stack width is much smaller than the wavelength. Hence, the polarization factors p₁ and p₂ are given:

$\begin{matrix} {p_{1} = {\exp\left\{ {{- C}\frac{\lambda}{W_{1}}} \right\}}} \\ {p_{2} = {\exp\left\{ {{- C}\frac{\lambda}{W_{2}}} \right\}}} \end{matrix}$ wherein C is a variable parameter determined experimentally for the actual stack structure. It is appreciated that in the absence of a pattern formed of metal lines, the optical factor C is equal to zero.

Similarly, in order to model the corrected filling factors, it is assumed that light radiation not reflected from a certain cell's stack from polarization considerations is essentially reflected by other cell's stack(s). Hence, the corrected filling factors are as found: A ₁ =a ₁ ·c ₁ ·p ₁ +a ₂·(1−c ₂ ·p ₂) A ₂=(a ₂ ·c ₂ ·p ₂ +a ₁·(1−c ₁ ·p ₁))·b ₂

The intensity of a reflected signal τ(λ) from each stack is calculated using layer thickness information and material optical parameters (constituting known features). To this end, standard equations for reflection from multi-layered stacks are used, based on Fresnel coefficients for reflection and transmission at interfaces as a function of wavelength for perpendicular incidence. The thickness for each layer is either known (being provided by the user) or calculated internally by the program. The materials of the layers and, therefore, their optical parameters, such as refraction indices and absorption, are known or calculated.

In view of the above and considering that both the coherent and incoherent parts contain contributions from two polarizations (e.g. R_(coh)=R^((p))+R^((s))), the total reflection R_(τoτ)constituting the theoretical data obtained by the optical model, is given:

$R_{TOT} = {{\left\{ {{{{r_{1} \cdot A_{1P}} + {r_{2} \cdot A_{2P}}}}^{2} + {{{r_{1} \cdot A_{1S}} + {r_{2} \cdot A_{2g}}}}^{2}} \right\} \cdot \frac{\gamma}{2}} + {\left\{ {{{r_{1}}^{2} \cdot A_{1P}^{2}} + {{r_{2}}^{2} \cdot A_{2P}^{2}} + {{r_{1}}^{2} \cdot A_{1S}^{2}} + {{r_{2}}^{2} \cdot A_{2S}^{2}}} \right\} \cdot \frac{1 - \gamma}{2}}}$ wherein τ₁ and τ₂ are the amplitudes of reflection from first and second stacks, respectively, of the cell, that is stacks 12 a and 12 b in the present example.

Other effects known in common practice (such as lateral reflection, roughness, etc.) have been found to have a negligible contribution under the defined conditions and are accounted for by the adjustment of the parameters A, B, C and D. turning back to FIG. 3, there is clearly illustrated that the curves D_(m) and D_(t) do not coincide, that is the theoretical data does not exactly match the measured data. A suitable merit function is used for determining the goodness of fit of the obtained results. By setting the values of the optical model parameters A, B, C and D the optical model is defined. By fitting the values of the desired parameters, e.g. W₁, W₂, d₁ and d₂, the theoretical data is optimized until the goodness of fit reaches a certain desired value (constituting a required condition). Upon detecting that the optimized theoretical data satisfies the required condition, the desired parameters of the structure, i.e. the W₁, W₂, d₁ and d₂ are calculated from the above equations.

It should be noted that in the most general case, when the grid cycle comprises two or more locally adjacent different elements (e.g., stacks), the above optical model is still correct. The mutual coherence ν′ is as follows:

$v^{\prime} = {\frac{\pi}{L}{\sum\limits_{i = 1}^{n}\; W_{i}}}$ wherein i is the i-th element (stack) in the grid cycle; n is the total number of elements within the grid cycle, and L is the coherence length. For the main factors on which the above optical model is based, we have:

-   -   Filling Factor

$a_{1} = \frac{W_{i}}{\sum\limits_{i = 1}^{n}\; W_{i}}$

-   -   Coupling Factor

$c_{i} = {\exp\left( {{{- A} \cdot \exp}\frac{\lambda}{W_{1}}} \right)}$

-   -   Dissipation Factor

$b_{m} = {\exp\left\{ {{- B_{m}}\frac{d_{m}}{\lambda}} \right\}}$ wherein m is the number of a dissipative element of the n stacks; d_(m) is the depth of the cavity-like part of the stack in relation to the neighboring stacks. For a non-dissipative stack, b_(n)=1, wherein n≠m.

-   -   Polarization Factor

$p_{i} = {\exp\left\{ {{- C} \cdot \frac{\lambda}{W_{i}}} \right\}}$

-   -   Corrected Filling Factor

$A_{j} = {b_{i} \cdot \left\lbrack {{a_{1} \cdot c_{1} \cdot p_{1}} + {\sum\limits_{j = {1{({j \neq i})}}}^{n}\;{a_{j}\left( {1 - {c_{j} \cdot p_{j}}} \right)}}} \right\rbrack}$

In view of the above, the total reflection R¹ _(TOT) is as follows:

${R_{TOT}^{\prime} = \left\{ {{{\sum\limits_{i}\left( {r_{i} \cdot A_{ip}} \right)}}^{2} + {{\sum\limits_{i}\left( {r_{i}{\cdot A_{ix}}} \right)}}^{2}} \right\}}{{\cdot \frac{\gamma}{2}} + \mspace{85mu}{\left\{ {{\sum\limits_{i}^{\;}\;\left( {{r_{i}}^{2}A_{ip}^{2}} \right)} + {\sum\limits_{i}\left( {{r_{i}}^{2} \cdot A_{ii}^{2}} \right)}} \right\} \cdot \frac{1 - \gamma}{2}}}$

Referring to FIG. 4, there is illustrated a part of a so-called “two-dimensional” structure 100, i.e. a structure periodical in both X- and Y-axes. This structure 100 is characterized by a plurality of grid cycles aligned along both the X- and Y-axes. The cycle aligned along the X-axis is formed of a pair of elements W₁ and W₂ (the stacks' widths), and the cycle aligned along the Y-axis is formed of a pair of elements G₁ and G₂ (the stacks' lengths). For example, the elements G₁ and G₂ may be, respectively, a metal layer stack and a block of Inter Layer Dielectric (ILD) stack. The measurement area S₁ defined by the diameter of the incident beam includes at least one cycle in X-direction and at least one cycle in Y-direction (several cycles in the present example).

Generally speaking, the cycle in either X- or Y-axis may be composed of several elements (e.g., stacks). If the measurement area S₁ is smaller than the surface area defined by the grid cycle along one of the axes X or Y, the total reflection (theoretical data) is determined in the manner described above with respect to the one-dimensional structure 10 (FIGS. 1 a and 1 b). If the measurement area is larger than the surface area defined by the grid cycle along both X- and Y-axes, then for the total reflection R_(2D) of such a 2D-structure we have:

$R_{2 - D} = {{\frac{G_{1}}{G_{1} + G_{2}} \cdot R_{G_{1}}} + {\frac{G_{2}}{G_{1} + G_{2}} \cdot R_{G_{2}}}}$ wherein R_(G1) and R_(G2) are the intensities of reflection signals from the two one-dimensional structures aligned along the Y-axis and having the widths G₁ and G₂, respectively. It should be noted that the Y-axis is no more than a notation, i.e. has no physical significance, and can be exchanged with the X-axis. For the general case of k elements in the cycle aligned along the Y-axis, we have:

$R_{2D} = \frac{\sum\limits_{i = 1}^{k}\;{G_{i} \cdot R_{G_{i}}}}{\sum\limits_{i = 1}^{k}\; G_{1}}$ wherein R_(Gi) and G_(i) are the reflection intensity from and width of the i-th element.

In general the axis location for calculating the reflection intensities R_(Gi) is chosen so as to satisfy the following: G₁ +G ₂>W₁+W₂ If the above condition is not satisfied, than the two axes are exchanged accordingly.

The main principles of a method according to the invention will now be described with reference to FIGS. 5 a and 5 b. The structure of the required measurement area is examined (step 46) so as to determine whether the above measurement area condition is satisfied within the existing pattern (step 48). If this condition is not satisfied, a test-site structure satisfying the condition is designed on the reticle (step 50), the test-site being typically provided within a so-called “margin region”.

Then, an initial learning mode of operation is performed, generally at step 52. The learning mode is aimed, on the one hand, at providing the measured data and, on the other hand, at optimizing the optical model. During the learning mode, the system 14 operates in the manner described above for detecting light reflected from the illuminated area substantially at zero-order and obtaining the measured data in the form of photometric intensities of each wavelength within the wavelength range of the incident radiation (step 54). Concurrently, the processor 20 applies the above optical model for obtaining the theoretical data (step 56) and compares it to the measured data (step 58). The optical model is based on some known features of the structure and nominal values of unknown features (i.e. of the desired parameters to be measured) which are provided by the user. At this stage, the relation between the theoretical data and the measured data is compared to a certain condition (step 62). If the condition is satisfied then, correct values of the parameters A, B, C and D are calculated (step 64) and an optimized optical model is obtained (step 66). If the condition is not satisfied then the optical model factors A, B, C and D and the unknown features are adjusted (step 60) until the condition is satisfied. It should be noted, although not specifically illustrated, that at this initial learning stage, the desired parameters can be calculated.

Thereafter the measurement mode of operation is performed, generally at step 68. To this end, the measured and theoretical data are concurrently produced (steps 70 and 72, respectively). It is appreciated that the theoretical data now produced is based on the known parameters of the structure, previously calculated correct values of the optical factors A, B, C and D and on the nominal values of the desired parameters to be measured. Similarly, the optimized theoretical data is compared to the measured data so as to determine whether or not the theoretical data satisfies a required condition (step 74), e.g. the goodness of fit is of a desired value. If so, the desired parameters are calculated (step 76) and if not the desired parameters are adjusted (step 78) until the theoretical data substantially matches the measured data. If desired, the measurement mode (step 68) is then repeated for inspecting a further location on the structure 10 (step 80).

Referring to FIGS. 6 and 7, there are illustrated in a self-explanatory manner two examples of patterned structures, designated 110 and 210, respectively, which can be inspected in the above described manner by the system 14. Each of the structures 110 and 210 consists of cells 112 and 212, respectively, each cell including two stacks formed of different layers. The parameters to be measured in these structures are, respectively, the width of a photoresist layer on top of the aluminum and the depth of the etched area (Air) within the silicon oxide layer.

Referring to FIGS. 8 and 9, there are illustrated two more example of patterned structures, designated 310 and 410, respectively, whose parameters can be measured in accordance with the invention. Here, the parameters to be measured are, respectively, the width and thickness of an aluminum layer on top of the silicon oxide and the remaining thickness of the metallic layer on the silicon oxide layer undergoing chemical mechanical polishing.

It is appreciated that polarization effects are present in the structures 310 and 410 due to the existence of patterned metal in both structures, while being weak in the structures 110 and 210.

FIG. 10 illustrates a patterned structure 510 utilizing copper under and between any two S_(i)O₂-based layers known as Interlayer Dielectric (ILD) insulating layer. CMP process applied to such a copper-based structure 510 typically results in copper loss portions, generally at P_(i), a so-called “dishing” effect. This effect is associated with the properties of copper (e.g., softer nature as compared to other metals) and the chemical nature of the copper-based CMP process. The parameters to be measured are the depths d₁ and d₁ of, respectively, the uppermost ILD insulating layer and the dishing-associated portion P_(i). In certain cases, depending on the layer stacks, the metal thickness can be determined by (d₁−d₂).

It has been found by the inventors that the measurements can be even more optimized by taking into account the relation between the wavelength λ of incident light and the pitch size of the patterned structure (i.e., Λ=W₁+W₂ in the above examples), when selecting an optical model to be used. The above-described optical model is the optimal one for the case when Λ>λ.

Reference is now made to FIG. 11 showing a patterned structure 610 formed on a substrate layer L₁. The structure 610 is composed of a plurality of cells (grid cycles) 612, two such cells being completely shown in the figure, and can for example be formed of straight lines of metal in dielectric matrix. In this specific example, each grid cycle 612 comprises three stacks 612 a, 612 b and 612 c. Each stack is formed of different K layers having different dielectric permittivity ε. More specifically, the stack 612 a includes K layers with the dielectric permittivities ε₁(1), ε₁(2), . . . , ε₁(K), respectively; the stack 612 b includes K layers with the dielectric permittivities ε₂(1), ε₂(2), . . . , ε₂(K), respectively; and the stack 612 c includes K layers with the dielectric permittivities ε₃(1), ε₃(2), . . . , ε₃(K), respectively. An ambient, “superstrate” layer L₀ is considered as the upper layer of the structure 610.

Thus, the entire structure to be measured is formed of j layers including the 0-th superstrate layer L₀, K stack layers, and the lower substrate layer L₁, that is, j=K+2. Index j=0 corresponds to the superstrate layer L₀, and index j=K+1 corresponds to the substrate layer L₁.

In the present example, the normal incidence of light onto the patterned structure 612 (e.g., straight lines of metal in dielectric matrix) is considered. Here, the Z-axis is perpendicular to the surface of the structure (i.e., parallel to the direction of propagation of incident light towards the structure), the X-axis is perpendicular to the lines of metal (i.e., elements of the pattern), and the Y-axis is parallel to the metal lines.

In the most general case, the structure 612 has N stacks (n=1, . . . , N), each with K layers, the width of the n-th stack being ΔX_(n). The structure pitch Λ_(X) is thereby determined as follows:

$\Lambda_{x} = {\sum\limits_{n = 1}^{N}\;{\Delta\; X_{n}}}$

The interaction of the incident light with each layer can be described using the effective permittivity tensor:

$ɛ = \begin{pmatrix} ɛ_{X} & 0 \\ 0 & ɛ_{Y} \end{pmatrix}$

The above tensor describes the case of relatively small values of pitch Λ_(X) (the period of grating along the X-axis), as compared to the wavelength of incident light λ, i.e., Λ_(X)/λ<1. The tensor components ε_(X) and ε_(Y) correspond to the electric field vector parallel to the X-axis and Y-axis, respectively (i.e., perpendicular (TM) and parallel (TE) to the metal lines, respectively).

Keeping in mind that the structure under measurements is composed of a plurality of layers, the components of the effective permittivity tensor, ε_(X)(j) and ε_(Y)(j), in j-th layer have the form:

$\begin{matrix} {\left\lbrack {ɛ_{X}(j)} \right\rbrack^{- 1} = {\sum\limits_{n = 1}^{N}\;{\left\lbrack {ɛ_{n}(j)} \right\rbrack^{- 1}\frac{\Delta\; X_{n}}{\Lambda_{X}}}}} \\ {{ɛ_{Y}(j)} = {\sum\limits_{n = 1}^{N}\;{{ɛ_{n}(j)}\frac{\Delta\; X_{n}}{\Lambda_{X}}}}} \end{matrix}$ wherein ε_(n)(j) is the permittivity of j-th layer in n-th stack; ΔX_(n) is the width of the n-th stack; N is the number of stacks.

The total reflectivity of the structure R_(TOT) the form: R _(τOτ) =ψ|R _(x)(0)|²+(1−ψ)|R _(Y)(0)|² wherein ψ describes the polarization of light ψ=0 for light polarized along Y-axis, ψ=1 for light polarized along the X-axis, ψ=0.5 for unpolarized light; R_(x)(0) and R_(Y)(0) are reflectivity amplitudes of the entire structure along the X- and Y-axis, respectively, which are functions of the effective permittivity.

The reflectivity amplitudes R_(x)(j) and R_(Y)(j) can be determined using the following recurrent expressions:

${R_{X}(j)} = \frac{{r_{X}(j)} + {{R_{X}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{X}\left( {j + 1} \right)}} \right\rbrack}}}{1 + {{r_{X}(j)}{R_{X}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{X}\left( {j + 1} \right)}} \right\rbrack}}}$ for  j = K, K − 1, … , 1, 0. ${R_{Y}(j)} = \frac{{r_{Y}(j)} + {{R_{Y}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{Y}\left( {j + 1} \right)}} \right\rbrack}}}{1 + {{r_{Y}(j)}{R_{Y}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{Y}\left( {j + 1} \right)}} \right\rbrack}}}$ for   j = K, K − 1, … , 1, 0. wherein τ_(x)(j) and τ_(Y)(j) are reflectivity amplitudes of each of the j layers, σ_(x)(j) and σ_(Y)(j) are complex coefficients showing both the attenuation and phase shift of the TM or TE light within the j-th layer, and are determined as follows:

$\begin{matrix} {{r_{X}(j)} = \frac{\sqrt{ɛ_{X}(j)} - \sqrt{ɛ_{X}\left( {j + 1} \right)}}{\sqrt{ɛ_{X}(j)} - \sqrt{ɛ_{X}\left( {j + 1} \right)}}} & {{r_{Y}(j)} = \frac{\sqrt{ɛ_{Y}(j)} - \sqrt{ɛ_{Y}\left( {j + 1} \right)}}{\sqrt{ɛ_{Y}(j)} - \sqrt{ɛ_{Y}\left( {j + 1} \right)}}} \\ {{\sigma_{X}(j)} = {\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{X}(j)}}} & {{\sigma_{Y}(j)} = {\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{Y}(j)}}} \end{matrix}$ In the above equations, d(j) is the thickness (depth) of the j-th layer

The reflectivity amplitudes τ_(x)(j) and τ_(Y)(j) do not take into account the interference of waves reflected from different layers. They describe the reflectivity from the interface between the j-th and (j+1)-th substances only. In other words, they correspond to the reflectivity from the interface of two semi-infinite volumes with the permittivities ε_(x)(j) and ε_(x)(j+1) in the case of TM polarization, and with the permittivities ε_(Y)(j) and ε_(Y)(j+1) for TE polarization.

On the other hand, the reflectivity amplitudes R_(x)(j) and R_(Y)(j) describe the reflectivity from the (j+1)-th layer with taking into account the interference of the waves reflected from the interfaces between the different layers. Thus, R_(x)(0) and R_(Y)(0) correspond to the reflectivity from the upper layer L₀ of the measured structure for TM and TE polarizations, respectively.

As indicated above, the complex coefficients σ_(x)(j) and σ_(Y)(j) show both attenuation and phase shift of the TM or TE light within the j-th layer: the real part of σ describes the phase shift and the imaginary part of σ describes the attenuation coefficient.

For the reflectivity amplitudes, the complex coefficients, and permettivity of the 0-th and (K+1)-th layers, we have:

R_(X)(K + 1) = 0 R_(Y)(K + 1) = 0 σ_(X)(K + 1) = 0 σ_(Y)(K + 1) = 0 ε_(X)(0) = ε_(L0) ε_(Y)(0) = ε_(L0) ε_(X)(K + 1) = ε_(L1) ε_(Y)(K + 1) = ε_(L1)

Let us generalize this approach to take into account one more possible case of so-called “middle pitches” with respect to the wavelength of incident light, i.e., Λ˜λ. In this case, the reflectivity amplitudes are functions of the effective permittivity of each n-th stack in j-th layer. The effective permettivity tensor has the form:

${ɛ\left( {j,n} \right)} = \begin{pmatrix} {ɛ_{X}\left( {j,n} \right)} & 0 \\ 0 & {ɛ_{Y}\left( {j,n} \right)} \end{pmatrix}$ wherein ε_(x)(j,n) and ε_(Y)(j,n) are as follows: εx(j,n)=εx(j)+α(λ)[εn(j)−εx(j)] εY(j,n)=εY(j)+α(λ)[εn(j)−εY(j)] Here, α(λ) is the coefficient, which is the monotonically decreasing function of wavelength of incident light λ, and is indicative of the effect of “mixing” of the two limiting cases Λ>λ and Λ<λ; α=0 for the case of Λ<λ, α=1 for the case Λ>λ, and 0<α<1 for the case of Λ˜λ.

Using the analogous formulas for each stack (rather than each layer), the total reflectivity R_(τoτ)can be expressed as follows:

$R_{TOT} = {{\psi{{\sum\limits_{n = 1}^{N}\;{{R_{Y}\left( {0,n} \right)}\frac{{\Delta\; X_{n}}\;}{\Lambda_{X}}}}}^{2}} + {\left( {1 - \psi} \right){{\sum\limits_{n = 1}^{N}\;{{R_{Y}\left( {0,n} \right)}\frac{\Delta\; X_{n}}{\Lambda_{X}}}}}^{2}}}$ wherein N is the number of stacks; R_(x)(j,n) and R_(Y)(j,n) are the reflectivity amplitudes for n-th stack, and can be obtained using the previous recurrent equations, but utilizing the values of ε_(x)(j,n) and ε_(Y)(j,n) instead of values of ε_(x)(j) and ε_(Y)(j), respectively.

The reflectivity amplitudes R_(x)(j,n) and R_(Y)(j,n) can be obtained using the following recurrent expressions:

${R_{X}\left( {j,n} \right)} = \frac{{r_{X}\left( {j,n} \right)} + {{R_{X}\left( {{j + 1},n} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{X}\left( {{j + 1},n} \right)}} \right\rbrack}}}{1 + {{r_{X}\left( {j,n} \right)}{R_{X}\left( {{j + 1},n} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{X}\left( {{j + 1},n} \right)}} \right\rbrack}}}$ for   j = K, K − 1, … , 1, 0 ${R_{Y}\left( {j,n} \right)} = \frac{{r_{Y}\left( {j,n} \right)} + {{R_{Y}\left( {{j + 1},n} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{Y}\left( {{j + 1},n} \right)}} \right\rbrack}}}{1 + {{r_{Y}\left( {j,n} \right)}{R_{Y}\left( {{j + 1},n} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{Y}\left( {{j + 1},n} \right)}} \right\rbrack}}}$ for   j = K, K − 1, … , 1, 0 ${r_{X}\left( {j,n} \right)} = \frac{\sqrt{ɛ_{X}\left( {j,n} \right)}\; - \sqrt{ɛ_{X}\left( {{j + 1},n} \right)}}{\sqrt{ɛ_{X}\left( {j,n} \right)} - \sqrt{ɛ_{X}\left( {{j + 1},n} \right)}}$ ${r_{Y}\left( {j,n} \right)} = \frac{\sqrt{ɛ_{Y}\left( {j,n} \right)} - \sqrt{ɛ_{Y}\left( {{j + 1},n} \right)}}{\sqrt{ɛ_{Y}\left( {j,n} \right)} - \sqrt{ɛ_{Y}\left( {{j + 1},n} \right)}}$ ${\sigma_{X}\left( {j,n} \right)} = {\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{X}\left( {j,n} \right)}}$ ${\sigma_{Y}\left( {j,n} \right)} = {\frac{2\pi}{\lambda}{d(j)}\sqrt{ɛ_{Y}\left( {j,n} \right)}}$

As indicated above, the reflectivity amplitudes τ_(x)(j,n) and τ_(Y)(j,n) do not take into account the interference of the waves reflected from different layers. They describe the reflectivity from the interface between the j-th and (j+1)-th substances only. In other words, they correspond to the reflectivity from the interface of two semi-infinite volumes with the permittivities ε_(x)(j,n) and ε_(x)(j+1, n) in the case of TM polarization, and with the permittivities ε_(Y)(j, n) and ε_(Y)(j+1, n) in the case of TE polarization. The reflectivity amplitudes R_(x)(j,n) and R_(Y)(j,n) describe the reflectivity from the (j+1)-th layer within the n-th stack with taking into account the interference of the waves reflected from interfaces between the different layers. Thus, R_(x)(0,n) and R_(Y)(0,n) correspond to the reflectivity from the n-th stack in the upper layer of the measured structure for TM and TE polarizations, respectively.

The complex coefficients σ_(x)(j,n) and σ_(Y)(j,n) show both the attenuation and phase shift of the TM or TE light within the n-th stack of the j-th layer: the real part of σ describes the phase shift, and the imaginary part of σ describes the attenuation coefficient.

Index j=0 corresponds to the superstrate layer L₀, Index j=K+1 corresponds to the substrate layer L₁. Thus, we have:

R_(X)(K + 1, n) = 0 R_(Y)(K + 1, n) = 0 σ_(X)(K + 1, n) = 0 σ_(Y)(K + 1, n) = 0 ε_(X)(0, n) = ε_(L0) ε_(Y)(0, n) = ε_(L0) ε_(X)(K + 1, n) = ε_(L1) ε_(Y)(K + 1, n) = ε_(L1)

Referring to FIG. 12, there is illustrated a patterned structure 710 composed of grid cycles 712 (two such cycles being shown in the figure), which are formed on a substrate layer L₁ and are covered by a superstrate air layer L₀. In this specific example, the Cu-polishing during the dual Damascene process is considered. The grid cycle 712 consists of two stacks 712 a and 712 b of equal widths ΔX₁ and ΔX₂, respectively, such that ΔX₁=ΔX₂=Λ/2, Λ being the pitch.

In this case, the number K of stack layers is 3, and the total number of layers including the substrate and superstrate layers is 5, i.e., j=0,1,2,3,4. The condition of j=4 corresponds to the substrate layer L₁ (Si). The condition of j=3 corresponds to the lower oxide layer (SiO₂) with the thickness of about 5000A. The condition of j=2 corresponds to an etch stop layer (Si₃N₄) with the thickness of about 1000A. The condition of j=1 is the metal-containing layer (Cu in the first stack, and SiO₂ in the second stack) with the thickness of about 5000A. The condition of j=0 corresponds to the superstrate layer L₀ (water/air layer).

In this case, we have the trivial results for the layers 0, 2, 3, and 4, as follows:

ε_(X)(4, 1) = ε_(X)(4, 2) = ε(Si) ε_(Y)(4, 1) = ε_(Y)(4, 2) = ε(Si) ε_(X)(3, 1) = ε_(X)(3, 2) = ε(SiO₂) ε_(Y)(3, 1) = ε_(Y)(3, 2) = ε(SiO₂) ε_(X)(2, 1) = ε_(X)(2, 2) = ε(Si₃N₄) ε_(Y)(2, 1) = ε_(Y)(2, 2) = ε(Si₃N₄) ε_(X)(0, 1) = ε_(X)(0, 2) = ε(Air) = 1 ε_(Y)(0, 1) = ε_(Y)(0, 2) = ε(Air) = 1 For the metal-containing layer (j=1) with periodic structure (grating) we have the following results: _(εx)(1)=1/(0.5/_(ε)(Cu)+0.5/_(ε)(SiO₂))−average permittivity for TM polarization _(εx)(1,1)=_(εx)(1)+_(α)(λ)[ε(Cu)−_(εx)(1)] _(εx)(1,2)=_(εx)(1,)+_(α)(λ)[_(ε)(SiO₂)−_(εx)(1)] _(εY)(1)=0.5 _(ε)(Cu)+0.5 _(ε)(SiO₂)−average permittivity for TE polarization _(εY)(1,1)=_(εY)(1)+_(α)(λ)[_(ε)(Cu)−_(εY)(1)] _(εY)(1,2)=_(εY)(1)+₆₀ (λ)[_(ε)(SiO₂)−_(εY)(1)]

Here, α(λ) is the coefficient indicative of the “mixing” of two cases: Λ>λ and Λ<λ. The value α(λ) depends on the ratio between the pitch and the wavelength. Let us present the coefficient α(λ) as a linear function of wavelength λ (in nm): α(λ)=α₅₀₀+(α₉₀₀−α₅₀₀)(λ−500)/(900−500) If α(λ)<0, then α(λ)=0. If α(λ)>1, then α(λ)=1. Here, α₅₀₀ and α₉₀₀are the values of the coefficient α for the wavelength λ equal to 500 nm and 900 nm, respectively. For measured wavelength ranging between 500 nm and 900 nm, the approximate values of α₅₀₀ and α₉₀₀ for different values of pitch may be as follows: For Λ<0.20 μm, _(α500)=0.0 and _(α900)=0.0(Λ<λ); For Λ=0.32 μm, _(α500)≈0.1 and _(α900)≈0.2; For Λ=0.70 μm, _(α500)≈0.5 and _(α900)≈0.7; For Λ=1.50 μm, _(α500)≈0.7 and _(α900)≈0.9; For Λ>4.00 μm, _(α500)=1.0 and _(α900)=1.0(Λ>λ)

The above values are presented as non-limiting examples only. It should be understood that the values of α₅₀₀ and α₉₀₀ are considered as fitting parameters, because they depend not only on the ratio Λ/λ, but on the geometry of the structure as well (metal density, optical constants, substances, exact stack structure, roughness of the interface between the different stacks, etc.). These parameters α₅₀₀ and α₉₀₀ should be optimized once per each structure, and after fixing these optimized values, they should be maintained constant during the measurements.

In the previous examples of FIG. 11 and 12, one-dimensional periodicity in the layers was considered. To measure the oxide thickness in DRAM applications, however, the spectra from layers with two dimensional structure within the layers have to be analyzed. To this end, the previous one-dimensional consideration can be easily generalized to the case of the alternative (perpendicular) lines (metal lines) in different layers, i.e. the two-dimensional stack.

FIGS. 13A–13C illustrate a structure 810 of the typical DRAM application. FIG. 13A is a top view of the structure. FIGS. 13B and 13C are cross-sectional views taken along lines A—A and B—B, respectively.

Lines L⁽¹⁾ correspond to the STI (Shallow Trench Isolation), i.e., lines of SiO₂ within a lower level L₁ of Si substrate, and lines L⁽²⁾ correspond to the DRAM gate stack in the middle layer L₂ (SiN/Oxide/WSi/Poly/Gate Oxide stack). An upper layer L₃ of oxide is covered by an ambient air/water layer L₀.

In this case, the average permittivities ε_(x)(j) and ε_(Y)(j) of each layer for electric field vector being parallel to the X-axis and Y-axis, respectively, can be calculated. If the lines in the j-th layer are parallel to Y-axis, we have:

$\begin{matrix} {\left\lbrack {ɛ_{x}(j)} \right\rbrack^{- 1} = {\sum\limits_{n = 1}^{N}\;{\sum\limits_{m = 1}^{M}\;{\left\lbrack {ɛ_{n\; m}(j)} \right\rbrack^{- 1}\frac{\Delta\; X_{n}\Delta\; Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}} \\ {{ɛ_{Y}(j)} = {\sum\limits_{n = 1}^{N}\;{\sum\limits_{m = 1}^{M}\;{{ɛ_{n\; m}(j)}\frac{\Delta\; X_{n}\Delta\; Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}} \end{matrix}$ If the lines in the j-th layer area are parallel to the X-axis, we have:

$\begin{matrix} {{ɛ_{X}(j)} = {\sum\limits_{n = 1}^{N}\;{\sum\limits_{m = 1}^{M}\;{{ɛ_{n\; m}(j)}\frac{\Delta\; X_{n}\Delta\; Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}} \\ {\left\lbrack {ɛ_{Y}(j)} \right\rbrack^{- 1} = {\sum\limits_{n = 1}^{N}\;{\sum\limits_{m = 1}^{M}\;{\left\lbrack {ɛ_{n\; m}(j)} \right\rbrack^{- 1}\frac{\Delta\; X_{n}\Delta\; Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}} \end{matrix}$ In the above equations, N is the number of stacks extending along the X-axis; M is the number of stacks extending along the Y-axis; ΔX_(n) is the width of the n-th stack along the X-axis; ΔY_(m) is the width of the m-th stack along the Y-axis; ε_(nm)(j) is the permittivity of a cell in the j-th layer, the cell belonging to the n-th stack along the X-axis and the m-th stack along the Y-axis; Λ_(X) and Λ_(Y) are the pitches along the X-axis and Y-axis, respectively.

The structure pitches Λ_(Y) and Λ_(x) are determined as follows:

${\Lambda\; y} = {\sum\limits_{m = 1}^{M}\;{\Delta\; Y_{m}}}$

For the specific example of FIGS. 13A–13B, we have: N=2; M=2; _(Δ)X₁=0.3 nm; _(Δ)X₂=0.2 nm; _(Δ)Y₁=0.1 nm; _(Δ)Y₂=0.1 nm; _(Λx)=_(Δ) X ₁+_(Δ) X ₂=0.5 nm; _(ΛY)=_(Δ) Y ₁+_(Δ) Y ₂=0.2 nm

The total reflectivity R_(ror) has the following form:

$R_{TOT} = {{\psi{{\sum\limits_{n = 1}^{N}\;{\sum\limits_{m = 1}^{M}\;{{R_{X}\left( {0,n,m} \right)}\frac{\Delta\; X_{n}\Delta\; Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}}^{2}} + \mspace{85mu}{\left( {1 - \psi} \right){{\sum\limits_{n = 1}^{N}\;{\sum\limits_{m = 1}^{M}{{R_{Y}\left( {0,n,m} \right)}\frac{\Delta\; X_{n}\Delta\; Y_{m}}{\Lambda_{X}\Lambda_{Y}}}}}\; }^{2}}}$ wherein ψ describes the polarization of light, the condition ψ=0 corresponding to light polarized along Y-axis, the condition ψ=1 corresponding to light polarized along the X-axis, the condition ψ=0.5 corresponding to unpolarized light.

The reflectivity amplitudes R_(x)(j,n,m) and R_(Y)(j,n,m) from different two dimensional stacks (n,m) can be calculated using the above equations, but with the permittivities ε_(x)(j,n,m) and ε_(Y)(j,n,m), wherein index j describes the layer, indices n and m describe those stacks along the X- and Y-axes to which the (n,m)-substack belongs. The permittivities ε_(x)(j,n,m) and ε_(Y)(j,n,m) are calculated as follows: ε_(x)(j,n,m)=ε_(X)(j)+α(λ)[ε_(nm)(j)−ε_(X)(j)] ε_(Y)(j,n,m)=ε_(Y)(j)+α(λ)[ε_(nm)(j)−ε_(Y)(j)]

The measurements can be further optimized by selecting an optical model that takes into account the relation between the wavelength λ of incident light and the space size between two locally adjacent metal lines of the patterned structure (i.e., the width W₂ of the stack 12B in the example of FIGS. 1A and 1B or the width W₂ of the stack 712B in the example of FIG. 12). The model described below with reference to FIG. 14 keeps to be valid for both the case of W₂>λ and the case of W₂˜λ (e.g., 0.5λ<ΔX₁<∞).

A patterned structure 910 shown in FIG. 14 is generally similar to the structure 710 of FIG. 12. The structure 910 formed on the substrate layer L₁ is composed of a plurality of cells (grid cycles) 912 (formed, for example, of straight lines of metal in a dielectric matrix), and is covered by the upper, ambient, “superstrate” layer L₀.

In the present example, the normal incidence of light onto the patterned structure 910 (e.g., straight lines of metal in dielectric matrix) is considered. Here, the Z-axis is perpendicular to the surface of the structure (i.e., parallel to the direction of propagation of incident light towards the structure), the X-axis is perpendicular to the lines of metal (i.e., elements of the pattern), and the Y-axis is parallel to the metal lines. A layer L_(m) is the opaque regions containing layer and consists of opaque regions (lines) 913A and transparent regions 913B. In the present example, the opaque lines are metal lines (copper, tungsten, aluminum, etc.), the transparent lines are oxide lines (or low-k dielectric materials). Each transparent oxide line in the layer L_(m) is surrounded by the opaque metal lines (Cu lines), and therefore forms a wave-guide. The eigen function E(x,y,z) representing the electric field distribution within a wave-guide (electric field vector being parallel to the lines) has the form:

$\left. \left. {{E\left( {x,y,z} \right)} = {{{\psi(x)}\left\lbrack {A\mspace{14mu}{\exp\left( {{+ {\mathbb{i}}}\frac{2\pi}{\lambda}n_{eff}z} \right)}{\exp\left( {{- \frac{2\pi}{\lambda}}k_{eff}z} \right)}} \right)} + \mspace{130mu}{B\;{\exp\left( {{- {\mathbb{i}}}\frac{2\pi}{\lambda}n_{eff}z} \right)}{\exp\left( {{+ \frac{2\pi}{\lambda}}k_{eff}z} \right)}}}} \right) \right\rbrack$ wherein ψ(x) has the sinus-like shape within the oxide lines and decreases rapidly (on the scale of the metal skin depth δ) within the metal lines; n_(eff)(λ) is the effective refractive index n_(eff)(λ); and k_(eff)(λ) is the effective extinction coefficient.

The effective refractive index n_(eff)(λ) and effective extinction coefficient k_(eff)(λ) for the main mode have the form:

${{n_{eff}(\lambda)} + {{\mathbb{i}}\;{k_{eff}(\lambda)}}} = \sqrt{\left( {{n_{Oxide}(\lambda)} + {{\mathbb{i}}\;{k_{Oxide}(\lambda)}}} \right)^{2} - \left( \frac{\lambda}{2\left( {W_{2} + {2{\delta(\lambda)}}} \right)} \right)^{2}}$ wherein n_(Oxide)(λ) and k_(Oxide)(λ) are the refraction index and extinction coefficient, respectively, of the transparent lines (oxide in the present example), and δ(λ) is a metal skin depth.

The skin depth δ(λ) depends on the wavelength, but in the first approximation it can be kept constant for the given wavelength range. In the present example, δ is equal to the copper skin depth δ_(Cu). Parameter δ should be optimized for each metal (copper, tungsten, aluminum, etc.) and a specific application, because the optical properties of the metal lines depend on their composition and preparation. For example, the skin depth for the electroplated copper metal can be slightly different from that of the PVD deposited copper seed layer. The value of δ also depends on the wavelength range. The calculation technique of the present invention is applicable for Al and W lines as well, by using the corresponding value of δ (δ_(A1) for aluminum or δw for tungsten).

According to this model, where the lines 613B of transparent material are surrounded by the opaque (metal) lines 613A (layer L_(m)) the effective optical constants n_(eff)(λ) and k_(eff)(λ) are utilized, rather than the optical constants of the transparent material (n_(Oxide) and k_(Oxide) in the present example).

The present example utilizes the previously described calculation techniques but using the effective optical constants in the stack 712B for layers L_(m) with metal lines satisfying the following condition: ε₂(J)=(n _(eff)(λ)+ik _(eff)(λ))² wherein J is a number of transparent layer in the stack 712B, which is surrounded by the metal lines.

Turning back to the previously described approach, the reflectivity amplitude τ₁=R₁(0) from the stack 712A and τ₂=R₂(0) from the stack 712B were obtained using the following recurrent expressions:

${R_{1}(j)} = \frac{{X_{1}(j)} + {{R_{1}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{1}\left( {j + 1} \right)}} \right\rbrack}}}{1 + {{X_{1}(j)}{R_{1}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{1}\left( {j + 1} \right)}} \right\rbrack}}}$ for   j = K, K − 1, … , 1, 0. ${R_{2}(j)} = \frac{{X_{2}(j)} + {{R_{2}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{2}\left( {j + 1} \right)}} \right\rbrack}}}{1 + {{X_{2}(j)}{R_{2}\left( {j + 1} \right)}{\exp\left\lbrack {{- 2}{\mathbb{i}}\;{\sigma_{2}\left( {j + 1} \right)}} \right\rbrack}}}$ for   j = K, K − 1, … , 1, 0. ${\chi_{1}(j)} = {{\frac{\sqrt{ɛ_{1}(j)}\; - \sqrt{ɛ_{1}\left( {j + 1} \right)}}{\sqrt{ɛ_{1}(j)} - \sqrt{ɛ_{1}\left( {j + 1} \right)}}\mspace{25mu}{\chi_{2}(j)}} = \frac{\sqrt{ɛ_{2}(j)} - \sqrt{ɛ_{2}\left( {j + 1} \right)}}{\sqrt{ɛ_{2}(j)}\sqrt{ɛ_{2}\left( {j + 1} \right)}}}$ ${\sigma_{1}(j)} = {{\frac{2\pi}{\lambda}{d_{1}(j)}\sqrt{ɛ_{2}(j)}\mspace{110mu}{\sigma_{2}(j)}} = {\frac{2\pi}{\lambda}{d_{2}(j)}\sqrt{ɛ_{2}(j)}}}$

These can be used in the above equation for R_(τOτ):

$\begin{matrix} {R_{TOT} = {{\left\{ {{{{r_{1} \cdot A_{1P}} + {r_{2} \cdot A_{2P}}}}^{2} + {{{r_{1} \cdot A_{1S}} + {r_{2} \cdot A_{2S}}}}^{2}} \right\} \cdot \frac{\gamma}{2}} +}} \\ {\left\{ {{{r_{1}}^{2} \cdot A_{1P}^{2}} + {{r_{2}}^{2} \cdot A_{2P}^{2}} +} \right.r_{1}{\left. ^{2}{{\cdot A_{1S}^{2}} + {{r_{2}}^{2} \cdot A_{2S}^{2}}} \right\} \cdot \frac{1 - \gamma}{2}}} \end{matrix}$ constituting the theoretical data obtained by this optical model.

The above-described approach is based on the reflectivity amplitudes _(χ1)(j) and _(χ2)(j) that do not take into account the interference of the waves reflected from different layers. They described the reflectivity from the interface between the j-th and (j+1)-th substances only. In other words, the reflectivity amplitudes used in the previously described models correspond to the reflectivity from the interface of two semi-infinite volumes with the permittivities _(ε1)(j) and _(ε1)(j+1) for the stack 712A, and _(ε2)(j) and _(ε2)(j+1) for the stack 712B.

In the present example, the reflectivity amplitudes R₁(j) and R₂(j) describe the reflectivity from the (j+1)-th layer with taking into account the effect of interference between the waves reflected from interfaces between the different layers. Thus, R₁(0) and R₂(0) correspond to the reflectivity from the upper layer of the measured structure for stacks 912A and 912B, respectively.

The complex coefficients _(σ1)(j) and _(σ2)(j) show both the attenuation and the phase shift of light within the j-th layer of stacks 712A and 712B. While the real part of the coefficient σ describes the phase shift, the imaginary part thereof describes the attenuation coefficient.

Index j=0 corresponds to the superstrate, and index j=K+1 corresponds to the substrate:

R₁(K + 1) = 0 R₂(K + 1) = 0 σ₁(K + 1) = 0 σ₂(K + 1) = 0 ε₁(0) = ε_(SUPERSTRATE) ε₂(0) = ε_(SUPERSTRATE) ε₁(K + 1) = ε_(SUBSTRATE) ε₂(K + 1) = ε_(SUBSTRATE)

Thus, by taking into account the relation between the wavelength of incident light and the structure geometry (pitch or space size between the opaque lines), the mostly preferred optical model can be selected accordingly and applied to carry out measurements of the structure parameters.

Those skilled in the art will readily appreciate that many modifications and changes may be applied to the invention as hereinbefore exemplified without departing from its scope defined in and by the appended claims. For example, the patterned structure may comprise any number of cells, each cell being formed of any number of stacks. In the method claims that follow, characters, which are used to designate claim steps, are provided for convenience only and do not apply any particular order of performing the steps. 

1. A method of optical measurements on a patterned structure having an array of spaced-apart metal-containing regions located within at least one substantially transparent layer, the method comprising: a) providing an optical model, based on features of a structure having an array of spaced-apart metal-containing regions located within at least one substantially transparent layer, which features are defined by a certain process of the structure manufacturing, the optical model being configured for determining theoretical data representative of spectral characteristics of light returned from the structure and for calculating a thickness of the metal-containing region of the structure; b) applying optical measurements to said patterned structure, by illuminating it with incident radiation of a preset substantially wide wavelength range, detecting light returned from the patterned structure, and obtaining measured data representative of spectral characteristics of the returned light within said wavelength range; c) processing and analyzing the measured data using said theoretical data, to determine a thickness of the metal-containing region of the patterned structure under measurements.
 2. The method according to claim 1, wherein said processing and analyzing comprises determining a relation between the measured data from the metal-containing regions and at least one substantially transparent layer.
 3. The method according to claim 2, wherein said relation is indicative of the thickness of the metal-containing regions of the patterned structure.
 4. The method according to claim 1, wherein said model is selected to have the thickness of the metal-containing regions of the patterned structure as a variable model parameter.
 5. The method according to claim 1, wherein said structure is a semiconductor wafer structure.
 6. The method according to claim 5, wherein said measured data is indicative of the thickness of the metal-containing regions comprising copper.
 7. The method according to claim 6, wherein said measured data is indicative of the thickness of the metal-containing regions within multi-layer regions.
 8. The method according to claim 1, wherein the optical model being representative of a polarazation effect due to interaction of the incident radiation with the metal-containing regions.
 9. The method according to claim 8, wherein the detecting light returned from the optical model being representative of a polarization effect due to interaction of the incident radiation with the metal-containing regions.
 10. The method according to claim 1, wherein the features of the structure, on which the optical model is based, comprise nominal values of the thickness parameters to be measured.
 11. The method according to claim 1, wherein said processing and analyzing comprises: comparing the theoretical data with the measured data and providing data indicative of the relation between the measured and theoretical data.
 12. The method according to claim 1, wherein the optical measurements are applied to a test site representing a test pattern similar to that of the structure under measurements in design rules and layer stacks.
 13. The method according to claim 1, wherein the uppermost layer is a layer to be removed during the manufacturing process.
 14. The method according to claim 1, wherein said manufacturing process includes Chemical Mechanical Planarization (CMP).
 15. The method according to claim 14, wherein the uppermost layer is a layer to be removed during the CMP.
 16. The method of claim 1, utilizing a spectrometric measurement system integrated with a processing tools arrangement carrying out the manufacturing process.
 17. The method of claim 1, for controlling the manufacturing process applied to the structure.
 18. The method of claim 1, for controlling the manufacturing process applied to the structure using an integrated spectrometric measurement system.
 19. The method of claim 14, for controlling the CMP process applied to the structure.
 20. The method of claim 19, utilizing a spectrometric measurement system integrated with the CMP tools arrangement.
 21. The method according to claim 9, wherein the optical model comprises features defined prior to the measurements.
 22. The method according to claim 9, wherein the features of the structure, on which the optical model is based, comprise nominal values of the thickness parameters to be measured.
 23. The method according to claim 9, wherein said processing and analyzing comprises: comparing the theoretical data with the measured data and providing data indicative of the relation between the measured and theoretical data.
 24. The method according to claim 9, wherein the uppermost layer is a layer to be removed during the manufacturing process.
 25. The method according to claim 9, wherein said manufacturing process includes Chemical Mechanical Planarization (CMP).
 26. The method according to claim 25, wherein the uppermost layer is a layer to be removed during the CMP.
 27. An apparatus for optical measurements in a patterned structure having an array of spaced-apart metal-containing regions arranged within an insulator layer, the apparatus comprising: a spectrometer for detecting a light response of a structure to incident light of a preset substantially wide wavelength range and for providing measured data representative of spectral characteristics of said light response within said wavelength range; and a processor unit coupled to the spectrometer, the processor being operable for applying an optical model based on features of a structure having an array of spaced-apart metal-containing regions located within an insulator layer, said features being defined by a certain process of the structure manufacture, to determine theoretical data representative of spectral characteristics of light returned from the structure and for calculating a thickness parameter of the metal-containing region of the structure, and processing and analyzing the measured data, associated with the patterned structure, using the theoretical data and determining a thickness of the metal-containing region of said patterned structure under measurements.
 28. The apparatus according to claim 27, wherein said spectrometer comprises a spectrometric detector and a variable aperture stop located in the optical path of light reaching the detector, the diameter of the aperture stop being variable in accordance with the pattern in the structure under measurements.
 29. The apparatus according to claim 27, wherein said the processor unit comprising a pattern recognition software for locating the measurements.
 30. The apparatus according to claim 27, wherein said spectrometer comprises a polarizer. 